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ABSTRACT 

Targeted endonucleases including zinc finger nu- 
cleases (ZFNs) and clustered regularly interspaced 
short palindromic repeats (CRISPRs)/Cas9 are in- 
creasingly being used for genome editing in higher 
species. We therefore devised a broadly applicable 
and versatile method for increasing editing efficien- 
cies by these tools. Briefly, 2A peptide-coupled co- 
expression of fluorescent protein and nuclease was 
combined with fluorescence-activated cell sorting 
(FACS) to allow for efficient isolation of cell pop- 
ulations with increasingly higher nuclease expres- 
sion levels, which translated into increasingly higher 
genome editing rates. For ZFNs, this approach, com- 
bined with delivery of donors as single-stranded 
oligodeoxynucleotides and nucleases as messenger 
ribonucleic acid, enabled high knockin efficiencies 
in demanding applications, including biallelic codon 
conversion frequencies reaching 30-70% at high 
transfection efficiencies and ^2% at low transfection 
efficiencies, simultaneous homozygous knockin mu- 
tation of two genes with ^1.5% efficiency as well as 
generation of cell pools with almost complete codon 
conversion via three consecutive targeting and FACS 
events. Observed off-target effects were minimal, 
and when occurring, our data suggest that they 
may be counteracted by selecting intermediate nu- 
clease levels where off-target mutagenesis is low, but 
on-target mutagenesis remains relatively high. The 



method was also applicable to the CRISPR/Cas9 sys- 
tem, including CRISPR/Cas9 mutant nickase pairs, 
which exhibit low off-target mutagenesis compared 
to wild-type Cas9. 

INTRODUCTION 

Nuclease-based technologies have opened unprecedented 
possibihties for targeted genome editing in numerous 
species and cell types previously found challenging for ge- 
netic modification. The general principle involves engineer- 
ing of endonucleases that can create a double-strand break 
at a desired site in genomic deoxyribonucleic acid (DNA) 
and vastly stimulate mutagenesis rates at that site. The 
technology may exploit natural homing endonucleases with 
specificities redirected towards a desired genomic sequence 
(1); alternatively, it may exploit non-specific nucleases, such 
as Fokl, that are targeted to a desired genomic location 
via fusion to protein modules engineered to bind a specific 
DNA sequence. The latter systems include zinc finger nu- 
cleases (ZFNs) (2,3) and transcription activator-like type 
II effector nucleases (TALENs) (4). ZFNs and TALENs 
function as heterodimers in which the individual monomers 
bind offset 9-18-bp target sequences on opposite strands of 
DNA and subsequently nick their respective strands to pro- 
duce a double-strand break. Recently, clustered regularly 
interspaced short palindromic repeat (CRISPR) systems for 
genome editing have been developed to introduce a double- 
strand break by the non-specific nuclease Cas9, which is di- 
rected to the desired locus by a 20-nt sequence contained 
within a so-called guide ribonucleic acid (gRNA) through 
Watson-Crick base pairing with target DNA (5-1 1). Most 
recently, pairs of gRNAs that target offset sequences on op- 
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posite strands of the target locus have been used in conjunc- 
tion with nickase mutants of Cas9. This represents an edit- 
ing system that is analogous to that of ZFNs and TALENs 
and shows greatly increased specificity as compared to the 
single CRISPR/Cas9 approach (12-14). 

No matter the type of engineered nuclease used, the 
ultimate goal is to produce a site-specific DNA double- 
strand break. Such breaks can be resolved via the relatively 
error-prone non-homologous end joining (NHEJ) pathway, 
which often inserts or deletes a number of bases at the 
break. If nucleases are targeted to a coding sequence, a 
frame shift and functional gene knockout may be the out- 
come. Alternatively, the DNA break can be repaired by the 
homology-directed repair (HDR) pathway using the sister 
chromatid as repair template. However, if an exogenous, ho- 
mologous DNA template (donor) containing a mutation is 
co-delivered into cells along with the nucleases, HDR can 
be exploited to precisely modify a genome in a user-defined 
manner. Short, homologous single-stranded oHgodeoxynu- 
cleotides (ssODNs) have also proven highly effective donors 
(15), exploiting repair mechanisms that are not entirely 
clear. 

The efficiency of nuclease-based generation of genome- 
edited clones from a targeted cell population is affected 
by several factors. One critical determinant is nuclease ex- 
pression levels. Nucleases are most often delivered to cul- 
tured cells by transfection of plasmid- or messenger ribonu- 
cleic acid (mRNA)-based expression constructs and less fre- 
quently via viral or protein delivery (16-19). Regardless 
of the method, nuclease delivery efficiencies and the resul- 
tant expression levels vary greatly between cell types. Even 
within a given cell population, nuclease expression levels of- 
ten vary substantially. Consequently, low nuclease expres- 
sion levels in individual cells and/or nuclease expression in 
only a small fraction of cells often represent a major bar- 
rier to the generation of modified clones from a targeted 
cell population. 

Expression of fluorescent proteins followed by 
fluorescence-activated cell sorting (FACS) is a pow- 
erful method for tracking cells of interest in a mixed 
population and has also been explored for nuclease genome 
editing. For instance, a fluorescence-based surrogate target 
gene reporter was co-transfected along with the nucleases 
and was used to enrich for cells with high nuclease activity 
(20). Furthermore, elegantly designed fluorescence-based 
reporter systems were used to explore repair mechanisms 
underlying nuclease genome editing, showing that elevated 
nuclease levels promote NHEJ and HDR and that elevated 
donor levels increase HDR whilst suppressing NHEJ (21). 

Here, we aimed to devise a FACS-based method for ob- 
taining cell populations with desired, high and uniform nu- 
clease expression levels from which genome-edited clones 
may be derived with increased efficiencies. The method was 
based on tightly coupled (1:1) co-expression of nuclease and 
fluorescent protein in the same cell. This was achieved via 
transfection with constructs in which nuclease and fluores- 
cent protein are encoded in the same transcript and linked 
by 2A peptide sequence (22), causing expression of nucle- 
ase and fluorescent protein as separate entities. Subsequent 
FACS allowed for easy isolation of cell populations with 
increasingly higher and uniform nuclease expression level 



which translated into increasingly higher genome editing 
frequencies. 

We explored this method in several variations that offer 
specific advantages depending on the application: linking 
the two nucleases of a pair to distinct fluorescent proteins 
for standard genome editing, linking nuclease pairs for dis- 
tinct genes to distinct fluorescent proteins for multiplex edit- 
ing, and consecutive rounds of fluorescent protein/nuclease 
delivery and FACS for generation of genome edited cell 
pools. In the diverse and often challenging applications 
tested, this method was found to greatly increase genome 
editing frequencies by ZFNs and CRISPR/Cas9. By com- 
bining the method with delivery of ZFNs as mRNA and 
donors as ssODNs, generation of biallelic knockin clones 
in large numbers was achieved. Observed ZFN off-target 
effects associated with the method were minimal. Further- 
more, in the event that an off-target effect is encountered, 
the method may be exploited to balance ZFNs to levels 
where off-target mutagenesis is low, but on-target mutagen- 
esis remains sufficiently high. For CRISPR/Cas9, off-target 
effects may be reduced by incorporation of nickase pairs 
into the method. 

MATERIALS AND METHODS 

ZFNs and donors 

ZFNs were of the CompoZr ZFN format generated by 
Sigma-Aldrich, except for the RSK4 ZFNs that were engi- 
neered by Sangamo Biosciences, Inc. The ZFNs contain the 
eHiFi (ELD/KKR) Fokl domains and function as obligate 
heterodimers with improved catalytic activity (23). The zinc 
finger moieties of the dimers were designed to target 30- 
33-bp recognition sites in total. The sequences of the major 
ZFNs used here {RSK2, RSK4 and PRMTl) are given in 
Supplementary Table SI . Plasmid and ssODN donors (Sup- 
plementary Table S2) were synthesized by Gene Oracle Inc. 
and Sigma-Aldrich (Genosys), respectively. 

pFluo-2A-ZFN constructs 

First, a so-called pFluo-2A-ZFN construct (Figure 1 A) was 
generated to co-express enhanced green-fluorescent pro- 
tein (EGFP) (24) and ZFN via the 2A peptide. Briefly, the 
EGFP-2A-FLAG-NLS sequence shown in Supplementary 
Figure S 1 and starting with an EcoRI site and ending with 
a Kpnl site was synthesized (Gene Oracle, Inc.). This se- 
quence was then used to replace the FLAG tag and nuclear 
localization signal (NLS) sequences excised from the stan- 
dard CompoZr ZFN expression construct, pZFN (which 
in this case encoded the left ZFN for a ZFN pair target- 
ing human RSK4), using the same restriction endonuclease 
sites. The resulting construct was used to generate all other 
pFluo-2A-ZFN derivatives reported here for co-expression 
of EGFP, the DsRed2 derivative E2 Crimson (25), Venus 
(26) or Cerulean (27) together with various ZFNs. First, 
fluorescent proteins were swapped by polymerase chain re- 
action (PCR) amplification of the fluorescent protein cod- 
ing sequence using up- and downstream primers with over- 
hangs containing Nsil and Avrll sites, respectively, followed 
by cloning of the PCR product into the pEGFP-2A-ZFN 
construct for RSK4 using these sites. Second, the various 
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fluorescent protein-2A-FLAG-NLS cassettes were cloned 
into pZFNs for various genes using the EcoRI/Kpnl sites, 
a cloning strategy that works for any CompoZr pZFN con- 
struct as long as these sites are absent from the zinc finger 
moiety of the construct. 

pFP-ZFN and pFP-ZFNL-ZFNR constructs 

Alternative versions of pFluo-2A-ZFN, called pFP-ZFN, 
were created. First, pFP-ZFNL and pFP-ZFNR constructs 
were generated that encode green-fluorescent protein (GFP) 
(28,29) and red fluorescent protein (RFP) (30), respectively, 
linked to ZFN via a 2A sequence (Supplementary Figure 
S7). Fluorescent proteins and ZFNs can be swapped using 
the Nsil and KpnI/XhoI restriction sites, respectively. To 
combine both ZFNs of a pair into a single, GFP-coupled 
construct, the 2A-ZFNR sequence can be excised from a 
pFP-ZFNR vector using the restriction enzymes Bglll and 
Xhol and then inserted into a pGFP-ZFNL construct di- 
gested with the same enzymes. The resulting pFP-ZFNL- 
ZFNR construct (Figure 3A) contains intervening 2A se- 
quences for co-expression of fluorescent protein, ZFNL and 
ZFNR as three separate entities. 

CRISPR/Cas9 constructs 

A single vector construct, pCMV-Cas9-GFP, was gene- 
synthesized to co-expresses gRNA targeting the KRAS 
or the VEGFA locus under control of the U6 promoter 
and Cas9-2A-GFP under control of the cytomegalovirus 
(CMV) promoter (Figure 5A). QuikChange mutagenesis 
(Stratagene) was used to swap the gRNA in these con- 
structs to gRNAs targeting the EMXl locus as well as to 
convert Cas9 into a DNA nickase (Cas9N) by introduction 
of the DlOA point mutation. Sequences of gRNAs and 
mutagenesis primers are provided in Supplementary Tables 
S3 and S4, respectively. QuikChange conditions were 
standard, except that the extension time was 16 min and 
18 cycles were performed to obtain robust amplification. 
Furthermore, 5% dimethyl sulfoxide (DMSO) was included 
in the reaction mixture. To swap GFP with DsRed2 in 
these constructs, a MegaQuikChange strategy was em- 
ployed. Briefly, the DsRed2 sequence was PGR amplified 
using the following primers: 5^GGAGAATCCTGGCC 
CACGATCGATGCATGATAGCACTGAGAACG-3^; 
S^CCTCTAGACTCGAGTCATCAGTTAACCTGGAA 
CAGGTGGTGGCGGG-3^ The PGR product was puri- 
fied and 200 ng were used as MegaQuikChange primers 
using pCMV-Cas9N-GFP as template. 

In vitro synthesis of mRNA 

mRNA was in vitro synthesized from pZFN or pFluo-2A- 
ZFN using the MessageMAX T7 ARCA-Capped Message 
Transcription Kit (#EPICMMA60710, Epicentre) and the 
Poly A tailing kit (#EPICPAP5104H, Epicentre) and puri- 
fied using the MEGAClear kit (Ambion) according to the 
manufacturers' instructions. 

Cell culture 

K562 and HCTl 16 cells were cultured in Iscove's modified 
Dulbecco's medium and McCoy's 5A medium, respectively. 



supplemented with 10% foetal bovine serum. MCFIOA cells 
were maintained in Ham's nutrient mixture F12/Dulbecco's 
modified Eagle's medium (1:1) supplemented with 5% horse 
serum, 10-|xg/ml insulin, 20-ng/ml epidermal growth fac- 
tor (EGF), 5-|xg/ml hydrocortisone and 100-ng/ml cholera 
toxin. Jurkat cells were cultured in Ro swell Park Memorial 
Institute- 1640 medium supplemented with 10% fetal bovine 
serum. Chinese hamster ovary (CHO) cells were maintained 
in EX-CELL CHO CD Fusion serum-free medium (Sigma- 
Aldrich). All media were supplemented with 1.5-mM L- 
glutamine (4 mM for CHO cells), 100-U/ml penicillin and 
100-fxg/ml streptomycin. K562- and Jurkat-conditioned 
media were prepared by mixing equal amounts of fresh 
medium with medium collected from either cell type cul- 
tured at ~0.5 X 10^ cells per ml for 24 h then sterilized using 
a 0.2-|jLm filter (Millipore). 

Expression comparison of ZFNs from pFluo-2A-ZFN versus 
pZFN 

For plasmid transfection, HCTl 16 cells were plated at 1.2 
X 10^ cells per 3.8 cm^ well. After 24 h, the cells were trans- 
fected with 0.4-|jLg DNA complexed with 2-(jl1 TransIT- 
LTl transfection reagent (#MIR2310, Mirus) according 
to the manufacturer's instructions. For mRNA transfec- 
tion, HCTl 16 cells were split and plated at a density lead- 
ing to ~70% confluency at the time of transfection 2 days 
later. Cells were then electroporated using a Nucleofec- 
tor (Lonza) set to program D-032. 1 x 10^ cells were dis- 
solved in 100-|jl1 Nucleofector Solution V (Lonza) with 2.0- 
|xg and 3.3-|jLg mRNA transcribed from pZFN and pFluo- 
2A-ZFN, respectively. The difference in mRNA amounts 
used was due to the size difference of the mRNA con- 
structs (~1170 and ~1950 bps, respectively, plus a poly A 
tail of ~150 bps) and ensured transfection with roughly 
equimolar amounts of both mRNA types. Three days post- 
transfection, cells were lysed in sodium dodecyl sulphate- 
polyacrylamide gel electrophoresis (SDS-PAGE) sample 
buffer (2% SDS, 62-mM Tris-HCl pH 6.8, 10% glycerol, 50- 
mM dithiothreitol and 0.12% bromophenol blue) and sub- 
jected to immunoblotting, as described below. 

Functional comparison of ZFNs expressed from pFluo-2A- 
ZFN versus pZFN 

K562 cells were split 2 days prior to electroporation so that 
the concentration of the cells was ~0.5 x 10^ cells per ml 
at the time of electroporation. The K562 electroporation 
was performed in a Nucleofector (Lonza) using program 
T-016 according to the manufacturer's instructions. Briefly, 
1 X 10^ cells in 100-|jl1 Nucleofector Solution V (Lonza) 
were mixed with the following reagents as appropriate for 
the experiment: for ssODN donor-based genome editing, 
3 |jl1 of 100-|jlM RSK4 Cys443Val/BamHI ssODN donor 
(equaling ~6.7 (xg) in 10-mM Tris (pH 7.6) was used to- 
gether with 2-(jLg ZFN-expressing plasmid (for each ZFN of 
the pair) or with 2-(jLg and 3.3-fxg ZFN-expressing mRNA 
(for each ZFN of the pair), when transcribed from pZFN 
and pFluo-2A-ZFN, respectively. For plasmid donor-based 
genome editing, 10-fxg RSK2 Cys436Val/BamHI plasmid 
donor was used together with 2-|jLg ZFN-expressing mRNA 
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when transcribed from pZFN or 3.3-|jLg ZFN-expressing 
mRNA when transcribed from pFluo-2A-ZFN. Four hours 
after nucleofection, cells were shifted to 30° C. Whilst a ben- 
eficial effect of this condition termed 'cold shock' has so far 
only been vahdated for NHEJ editing, we used it also for 
knockin editing, since it was reported to increase cellular 
ZFN levels (31). After 3 days, genomic DNA was isolated 
as described below. 

Genome editing using 2A-coupled co-expression of nuclease 
and fluorescent proteins combined with FACS 

K562 cells were electroporated with the various constructs 
for ZFN or CRISPR/Cas9 2A-linked to fluorescent pro- 
tein, as described in the section above and in the main text. 
Cefls were then cultured for 72 h with a 30°C cold shock 
from 4-68 h (ZFNs only). MCFIOA and Jurkat cells were 
electroporated as described for K562 except that Nucleofec- 
tor solution T/Nucleofector program T-020 and Nucleofec- 
tor Solution V/Nucleofector program X-001 were used for 
the two cell lines, respectively. In the comparison of the effi- 
ciency of ZFN genome editing using non-linked versus 2A- 
linked fluorescent protein, the non-linked fluorescent pro- 
tein was delivered as 250 ng of pGFP plasmid (contained in 
the Nucleofector kit from Lonza). 

Three days after transfection, the cells were subjected to 
FACS analysis. Briefly, cells (trypsinized if adherent and 
then diluted with culture medium to inactivate trypsin) were 
pelleted by centrifugation at 200 g for 4 min and resus- 
pended in normal culture medium to about 0.5-1 x 10^ cells 
per ml (NB: optimal cell densities will vary depending on 
cell type and whether bulk sorting or single-cell plating is 
the goal). Finally, the cells were passed through 50- [xm fil- 
cons (BD Biosciences) to achieve a single-cefl suspension. 
FACS analysis was performed on an FACS Aria I (BD Bio- 
sciences) with excitation and detection of emission as ap- 
propriate for the various fluorescent proteins. Fluorescence 
levels of mock-transfected cells were determined in parallel 
and considered background level, based on which cell pop- 
ulations with specific fluorescence signals could be assigned 
(see Supplementary Figure S5 for further details). Fluores- 
cent cell populations were then gated in defined fluorescence 
intensity intervals (e.g. 50% ib 5% or top 10%) of the total 
fluorescent cell population and isolated. To generate stably 
modified clones, cells isolated by FACS according to a de- 
sired fluorescence level or cells not subjected to FACS were 
single-cell sorted into 96-well plates in medium conditioned 
as appropriate and expanded to clonal cell lines. 

Genomic DNA extraction and PGR amplification 

For single-cell clones, DNA was isolated when the expanded 
cell clones grew dense. Briefly, the cells were washed with a 
IX phosphate-buffered saline (PBS) solution, lysed in 70- 
|jl1 lysis buffer [25-mM NaOH, 0.2-mM ethylenediaminete- 
traacetic acid (EDTA), pH 12], then incubated at 95°C 
for 30 min. Thereafter, 70-fxl neutralization buffer (40-mM 
Tris-HCL, pH 5) were added. Of this mixture, 1-3 |xl were 
used as template for PGR for genome editing analysis by re- 
striction fragment length polymorphism (RFLP) assay, as 
described below. 



For bulk-sorted cells, genomic DNA was isolated by in- 
cubation of the cells for 3 h at 55°C in lysis buffer [20-mM 
Tris-HCl (pH 8.0), 5-mM EDTA, 400-mM NaCl, 1% SDS, 
400-|xg/ml proteinase K], foflowed by phenol-chloroform 
extraction and isopropanol precipitation of the DNA. Ap- 
proximately 100 ng of genomic DNA were then used as tem- 
plate for PGR amplification and genome editing analysis by 
GEL-I or RFLP assays, as described below. PGR was car- 
ried out in 12.5-|jl1 reactions using Taq DNA polymerase 
(Ampliqon). The specific anneahng temperatures and addi- 
tives for the various PGRs are listed in Supplementary Table 
S5. The following touch-down PGR amplification condi- 
tions were used: after an initial 5-min denaturation at 95°C, 
samples were subjected to 15 cycles consisting of denatura- 
tion for 30 s at 95°C, then primer annealing and extension 
for 70 s at 70°C in the first cycle with a 0.5°C reduction of 
annealing/extension temperature in each subsequent cycle. 
This was followed by 23-26 cycles of denaturation for 30 s 
at 95°G, primer annealing for 30 s at various temperatures 
(provided in Supplementary Table S5) and extension for 70 
s at 72° C. The reactions were terminated by a final elonga- 
tion step of 5 min at 72° C. 

GEL-I assay 

GEL-I assays were performed according to the manufac- 
turer's instructions (Transgenomic). Briefly, 10 |xl of the 
PGR products amplified from wild-type and mutant alle- 
les were used to generate heteroduplexes using the follow- 
ing reaction conditions: an initial 10-min denaturation step 
at 95° C was followed by 1 s at 95° C and a step-wise 2°C re- 
duction in temperature with 1-s incubations down to 85°C, 
then 1 s at 85°G and a step-wise 0.1°C reduction in tem- 
perature with 2-s incubations down to 4°C. Next, the reac- 
tions were incubated at 42°C for 30 min with 0.4 (xl of SUR- 
VEYOR Nuclease S (GEL-I) and SURVEYOR Enhancer S 
(#706025, Transgenomic). One-tenth of the PGR volume of 
Stop Solution was added to terminate the reactions. Finally, 
the digests were resolved on 6% or 8% Tris-borate-EDTA 
(TBE) acrylamide gels, depending on the anticipated band 
sizes. Supplementary Table S5 provides the sequences of the 
primers used, as well as the sizes of diagnostic fragments 
and PGR products amplified from the targeted alleles. 

RFLP assay 

PGR reactions were digested by addition of 10 U of 
the appropriate restriction endonuclease and incubation 
overnight at 37°G. Digests were resolved on either 6% or 
8% TBE acrylamide gels, or on 1-2% 0.5 x Tris-acetate- 
EDTA (TAE) agarose gels, depending on the anticipated 
band sizes. When analysing clonal cell lines, mono- and 
biallelic knockin of diagnostic restriction sites in diploid 
loci were evidenced by ~50% and complete digestion of 
the PGR products, respectively (see Supplementary Figure 
S4 as an example). For triploid loci, triallelic knockin was 
evidenced as complete digestion, whereas partial digestion 
(~30-60%) was taken as mono- or biallelic knockin. Sup- 
plementary Table S5 provides the diagnostic restriction en- 
donucleases, sequences of the primers used, as well as the 
sizes of diagnostic fragments and PGR products amplified 
from the targeted alleles. 
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Densitometric quantization of CEL-I and RFLP assays 

Gels with digested PGR products were stained for 10 min 
with ethidium bromide and imaged with a Quantity One 
Gel Doc XR imaging system (Bio-Rad). Quantification was 
performed using Image J software (Rasband, W.S., Image J, 
U.S. National Institutes of Health, Bethesda, Maryland, 
USA, http://imagej.nih.gov/ij/, 1997-2012) and based on 
relative intensities of uncleaved versus cleaved PGR prod- 
ucts, and percentage cleavage was determined using the for- 
mula: % cleavage = 100 x (1-(1 -fraction cleaved) ^/^). 

Off-target analysis 

To predict candidate off-target sites for the human RSK4 
and RSK2 ZFNs, the Homo sapiens genome (GRCh37, 
hgl9) was scanned for sequences highly homologous to 
their on-target sites. To predict candidate off-target sites for 
the various CHO ZFNs, the Cricetulus griseus genome ver- 
sion 1 .0 was downloaded from NCBI (ID: AFTDOIOOOOOO) 
and indexed for use with Bowtie (32) for alignment with 
the relevant ZFN on-target sites. The Phred score thresh- 
old was set to 160, which in our case thus searched for sites 
with up to five mismatches per monomer binding site. Fi- 
nally, 100-bps up- and downstream of the candidate sites 
were extracted for primer design for genomic PGR. For 
both human and hamster, a 5- or a 6-bp spacer sequence 
between the ZFN monomer binding sites was allowed. Fur- 
thermore, searches were restricted to heterodimeric loca- 
tions since the ZFNs used contain obligate heterodimeric 
Fokl (ELD/KKR) domains, which have previously been 
shown to eliminate off-target activity at homodimeric tar- 
get locations (33). 

To perform Sanger sequencing of candidate off-target 
sites, the appropriate genomic loci were PGR amplified us- 
ing primers listed in Supplementary Table S5. The result- 
ing PGR products were then purified using carboxyl-coated 
magnetic beads. Briefly, an equal volume of PGR product 
and hybridization buffer (2.5-M NaGl, 20% polyethylene 
glycol (PEG) 8000) supplemented with 10 |jl1 of 10 mg/ml 
beads pre-washed three times with 0.5-M EDTA (#21353, 
Pierce) were mixed and incubated for 10 min at room tem- 
perature. Then, magnetic beads with bound PGR products 
were captured using a magnetic stand for 5 min and washed 
three times with 70% ethanol. The beads were air-dried and 
re-suspended in 20 fxl 10-mM Tris-HGl pH 8.0 buffer. For 
sequencing (GATG Biotech), 20-80 ng/(xl of the purified 
PGR products were pre-mixed with 25 pmol (in 5 |jl1) of the 
appropriate sequencing primers (Supplementary Table S5). 

Gene copy number analysis 

Genomic DNA was extracted from K562 and Jurkat cells 
using the FlexiGene DNA kit (Qiagen) and subjected to 
gene copy number analysis (performed at the Genter for Ge- 
nomic Medicine, Rigshospitalet) using GytoScan HD ar- 
rays (Affymetrix). Briefly, 250 ng of genomic DNA were 
digested and ligated to adapters. Adapter-ligated DNA 
was amplified, purified, fragmented, labelled with biotin 
and hybridized to the arrays for 18 h. The Fluidics Sta- 
tion 450 (Affymetrix) and the GeneGhip Scanner 3000 7G 
(Affymetrix) were used to wash, stain and scan the arrays. 



The generated .GEL files were then analysed in Partek Ge- 
nomics Suite 6.6. Briefly, data were imported using the rec- 
ommended default Partek settings: probes were adjusted 
for probe sequence, settings for no background and quan- 
tile normalization were applied and probes were allele- 
specifically summarized. Gene copy number and single nu- 
cleotide polymorphism (SNP) allele ratio were calculated 
using an unpaired analysis with the basehne distributed by 
Partek, which is based on 380 GytoScan HD array from the 
International HapMap project. Gopy number status of in- 
dividual genes was estabhshed by examining copy number 
calls combined with the SNP allele ratios, whilst also taking 
polyploidy into account. 

Stimulation and isolation of RSK2 for functional codon con- 
version assay 

To assay RSK2 activity, wild-type or RSK2-Gys436Val 
K562 cells plated at 2 x 10^ cells per well were serum-starved 
for 4 h and then incubated for 1 h with 10-|xM fmk. A fi- 
nal 20-min incubation with 100-nM phorbol 12-myristate 
13-acetate (PMA) was included, when indicated. The cells 
were solubilized for 15 min in 1-ml lysis buffer [300-mM 
NaGl, 50-mM Tris-HGl (pH 7.4), 0.5% Igepal GA-630, 1- 
mM Na3V04, 5-mM EDTA, 50-mM NaF, 10-nM calyculin 
A, 10-|xM leupeptin, 5-|jlM pepstatin and l-|jLg/ml apro- 
tinin] on ice and manipulated at <4°G thereafter. Gell ex- 
tracts were clarified by centrifugation for 5 min at 1 8 000 g 
and the supernatants were incubated for 90 min with 2 (xg of 
anti-RSK2 antibody (MGA3429Z, ABG Serotec), with the 
addition of 20-fxl protein G agarose beads (#16-266, Up- 
state) during the final 30 min. The beads were then precipi- 
tated by centrifugation, washed five times with lysis buffer, 
drained and dissolved in SDS-PAGE sample buffer. 

Immunoblot analysis 

Immunoprecipitated RSK2 and extracts of cells transfected 
with ZFN and fluorescent protein expression constructs 
were subjected to SDS-PAGE and the proteins were then 
transferred to Hybond-EGL nitrocellulose membranes (GE 
Healthcare). Membranes were blocked with 2% (w/v) EGL 
Advance Blocking Agent (GE Healthcare), followed by in- 
cubation with primary antibodies against active (Ser386- 
phosphorylated) RSK2 (#AF1094, R&D Systems), total 
RSK2 (MGA3429Z, ABG Serotec), FLAG epitope tag 
(horse radish peroxidase-coupled, #F3165, Sigma), EGFP 
(#8334, Santa Gruz Biotechnology; this antibody was also 
used to detect Venus and Gerulean) or DsRed2 antibody 
(#632496, Glontech), as appropriate. Thereafter, relevant 
horseradish peroxidase-coupled secondary antibodies and 
EGL Advance (GE Healthcare) were used for visualization 
of the primary antibodies. 

RESULTS 

Generation of a plasmid construct (pFluo-2A-ZFN) for 
tightly coupled co-expression of fluorescent protein and ZFN 
as separate entities 

We first created the construct pFluo-2A-ZFN that encodes 
a fluorescent protein hnked to a ZFN via a 2A peptide se- 
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Figure 1. High-efficiency genome editing via 2A-coupled co-expression of ZFNs and fluorescent proteins combined with FACS. (A) Schematic of the 
pFluo-2A-ZFN construct for 2A-coupled, 1 : 1 co-expression of ZFN and fluorescent protein. Functional parts indicated are CMV promotor, fluorescent 
protein (Venus, Cerulean, DsRed2, EGFP), 2A peptide, triple FLAG epitope tag, nuclear localization signal (NLS), zinc finger (ZF) moiety and Fokl 
nuclease moiety The restriction endonuclease sites Nsil/Avrll and EcoRI/Kpnl can be used to swap fluorescent protein and ZFN, respectively (B) Fluo- 
rescent protein and ZFN expressed from Fluo-2A-ZFN transcripts are co-expressed as separate entities due to translational skipping at the 2A sequence. 
(C) Overview of strategy for high-efficiency genome editing via FACS isolation of cells that co-express fluorescent protein and ZFNs. In this example, left 
and right ZFNs of a pair (ZFNL and ZFNR) have been coupled to distinct fluorescent proteins. (D) FACS profile of K562 cells 3 days after transfection 
with mRNA for EGFP-2A-ZFNL and DsRed2-2A-ZFNR targeting the RSK4 locus along with an ssODN knockin donor (RSK4 Cys443Val/BamHI). 
Cells from populations displaying low to high EGFP/DsRed2 double-fluorescence intensities {a)-(e) were FACS isolated and single-cell seeded for clonal 
expansion. Cells in the gate Q3 have fluorescence intensities similar to mock-transfected cells. (E) Frequency of monoallelic and biallelic knockin modifi- 
cation at (0 the diploid RSK4 locus in clonal cell lines derived from populations (fl)-(e) or at (//) the diploid RSK2 locus in a similar knockin experiment 
using reagents for RSK2 and analysing populations (a) and (c) only. Except for population (e) that was isolated only once, the data are means + SD of 
three independent experiments. (F) Functional codon conversion at the RSK2 locus. Wild-type or clonal K562 cells with biallelic Cys436Val knockin 
modification of RSK2 derived from (c) in panel (E) were analysed for sensitivity of RSK2 to fmk by immunoblot analysis after exposure of cells to the 
stimulus PMA in the absence or presence of fmk. The data are representative of those obtained with five clones analysed. (G) Genomic sequence of an 
RSK2 Cys436Val clone, indicating the ZFN-binding sites (black bars), the Val436 codon (red bar) and the diagnostic Sfcl site (green bar). The mutations 
introduced are boxed. Those occurring in the ZFN target site aimed to prevent re-cutting of the locus once the donor had been knocked in. 
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quence in one single reading frame (Figure 1 A and Supple- 
mentary Figure SI). The 2A peptide undergoes a transla- 
tional ribosomal skip of the glycyl-prolyl peptide bond at 
its C-terminus, which results in the expression of fluores- 
cent protein and ZFN as two separate entities (Figure IB). 
Of note, 2A-based approaches are superior with respect to 
producing 1 : 1 co-expression levels of proteins of interest, 
as compared to other co-expression methods (22). We de- 
signed pFluo-2A-ZFN such that a single cloning step can 
swap fluorescent protein (including EGFP, Venus, DsRed2 
and Cerulean) by using the sites Nsil/Avrll and ZFN by 
using the sites EcoRI/KpnI. 

pFluo-2A-ZFN can co-express diverse fluorescent proteins 
and ZFNs at normal expression and activity levels 

Next, we determined whether pFluo-2A-ZFN expresses the 
fluorescent protein and ZFN coding sequences as sepa- 
rate entities and whether ZFN expression and activity lev- 
els were comparable to those obtained with the CompoZr 
pZFN construct from which we derived pFluo-2A-ZFN. 
HCT116 cells were transfected with pZFN or pFluo-2A- 
ZFN encoding a variety of fluorescent proteins linked to ei- 
ther monomer of various ZFN pairs, denominated ZFN left 
(ZFNL) and ZFN right (ZFNR). Two days after transfec- 
tion, the cells were subjected to immunoblot analysis for the 
fluorescent protein or the FLAG epitope tag present in the 
ZFNs. In all instances, pFluo-2A-ZFN expressed the ZFN 
moieties cleaved from fluorescent protein at levels similar to 
those expressed from pZFN (Supplementary Figure S2A). 

Whereas the above experiments used plasmid-based ZFN 
delivery, ZFN coding sequences can also be delivered into 
cells as mRNA synthesized in vitro from vectors that con- 
tain the appropriate elements. When delivered as mRNA, 
Fluo-2A-ZFN constructs also expressed cleaved ZFNs at 
levels identical to those derived from pZFN (Supplemen- 
tary Figure S2B). Furthermore, pFluo-2A-ZFN was of 
comparable efficiency to pZFN in serving as a template for 
in vitro synthesis of mRNA (Supplementary Figure S2C). 

To test whether Fluo-2A-ZFN expresses ZFNs that func- 
tion effectively, human leukemic K562 cells were transfected 
with various derivatives of this construct, either as plas- 
mid or mRNA. Along with the ZFN constructs, we co- 
transfected plasmid or ssODN donor aiming to introduce 
a specific codon-swap mutation as well as a silent diagnos- 
tic restriction endonuclease site in the targeted genes (the 
sequences of donors are provided in Supplementary Table 

52) . Three days post-transfection, the cells were lysed and 
the cell lysates were used as template to PGR amphfy ge- 
nomic sequence spanning the mutation sites. The resulting 
PGR products were subjected to two different assays that 
both detect mutagenesis at the target site: either GEL-I as- 
say that detects any alterations at the target site or RFLP 
assay that specifically detects knockin of the diagnostic re- 
striction site. Both assays revealed that ZFNs derived from 
either pZFN or pFluo-2A-ZFN produced similar levels of 
genome editing. This was observed regardless of whether 
the ZFNs were delivered as plasmid or mRNA or whether 
the donor was plasmid or ssODN (Supplementary Figure 

53) . Note that these knockin experiments with plasmid and 
mRNA ZFN constructs were performed on different oc- 



casions and cannot be compared. In side-by-side compar- 
isons, we find that mRNA constructs are often the more ef- 
ficient for knockin, as we reported previously (15). 

Controlled low to high knockin editing frequencies by com- 
bining 2A-coupled co-expression of fluorescent proteins and 
ZFNs with FACS 

We next tested whether transfection with Fluo-2A-ZFN 
constructs followed by FAGS for isolation of ZFN trans- 
fectants can be used to increase the efficiency of genome 
editing, as assessed in clonal cell lines. We first focused on 
knockin editing, which is relatively demanding, and chose 
to deliver donors as ssODNs and nuclease constructs as 
mRNA, since these delivery forms offer various advantages 
over double- strand DNA construct delivery, as detailed in 
the Discussion section. 

First, we transfected K562 cells with mRNA constructs 
in which each ZFN of a pair was 2A-coupled to a dis- 
tinct fluorescent protein: EGFP for ZFNL and DsRed2 
for ZFNR. The ZFNs targeted the RSK4 locus which was 
found to be diploid in these cells by Gy to Scan HD array- 
based gene copy number analysis (Supplementary Table 
S6). Along with the ZFNs, we co-transfected an ssODN 
donor {RSK4 Gys443Val/BamHI) designed to knockin a 
specific codon-swap mutation and a silent diagnostic re- 
striction site in RSK4. FAGS analysis 3 days after transfec- 
tion revealed that the expression levels of both EGFP and 
DsRed2 varied ~ 100-fold in the cell population (Figure ID, 
Q2), illustrating the great lack of uniform expression in cells 
of a transfected cell pool. Also, a fraction of the fluores- 
cent cells were DsRed2 positive only (Figure ID, Q4: ~1.7% 
of the cells), thus apparently expressing only one ZFN of 
the pair, which is incompatible with respect to genome edit- 
ing. Using FAGS, we next isolated cell populations that 
displayed uniform EGFP and DsRed2 double-fluorescence 
levels ranging from low to high (Figure ID, a-e), aiming to 
recapitulate the finding in the model system by Gerto et al 
that increased nuclease levels result in increased editing fre- 
quencies (21). Thereafter, cells from each population were 
single-cell sorted into 96-well plates and expanded to clonal 
cell lines. The expanded clones were analysed using RFLP 
assay to detect knockin of the diagnostic restriction site in a 
monoallelic or biallelic state at the RSK4 locus (Supplemen- 
tary Figure S4). Three striking results were obtained [Fig- 
ure IE, (/)]. First, the knockin frequencies in the clones in- 
creased with the fluorescence level observed in the cells at 
the time of FAGS and single-cell plating. Second, ~95% of 
the 264 clones derived from the most highly fluorescent cell 
population 'e' had the diagnostic knockin mutation and, of 
these, ~70% had biallelic modification of the targeted lo- 
cus. Finally, a total of 520 biallelic knockin clonal cell lines 
were rapidly generated using this high-throughput genome 
editing method that was streamlined for 96-well plate for- 
mat and multi-channel pipette handhng from FAGS to the 
final step of genotyping of clones by RFLP assay. 

In a similar experiment, we targeted the RSK2 locus 
in K562 cells using mRNA for ZFNL coupled to EGFP 
and ZFNR coupled to DsRed2. Along with ZFNs, we co- 
transfected an ssODN donor {RSK2 Gys436Val/SfcI) de- 
signed to knockin a codon-swap mutation and a silent di- 
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agnostic restriction site in the RSK2 locus, which was found 
to be diploid in K562 cells by gene copy number analysis 
(Supplementary Table S6). Three days after transfection, 
EGFP and DsRed2 double-fluorescent cells were FACS iso- 
lated from populations with fluorescence levels similar to 
'a' and 'cm Figure ID, single-cell plated and expanded to 
clonal cell lines. The resulting cell lines were shown to har- 
bour low (~0.6%) and high (~64%) stable knockin mod- 
ification, respectively, of the RSK2 locus [Figure IE, (//)]. 
Of note, biallelic knockin clones constituted ~29% of those 
derived from population 'c'. By contrast, in an earlier study 
using the same reagents (except for minor differences in the 
donor), but where we did not implement FACS enrichment, 
only 2.7% biallelic knockin modification of the RSK2 locus 
was obtained (15). According to RNA-seq analysis, RSK4 
is expressed at > 1000 times lower levels compared to RSK2 
in K562 cells (our unpubhshed data), yet the two genes were 
knockin modified with rather similar efficiencies. 

We developed the present high-throughput knockin 
method with the aim of genetically manipulating the phar- 
macological sensitivity of many different protein kinases in 
diverse cell lines. For instance, the Cys436Val codon-swap 
knockin mutation introduced in RSK2 in Figure IE aimed 
at rendering this kinase insensitive to the small-molecule in- 
hibitor fmk. We tested whether such functional codon con- 
version had occurred in our clones by immunoblot analy- 
sis for active RSK2 in cells exposed to the stimulus PMA 
in the absence or presence of fmk. As shown in Figure 
IF, the targeting event generated RSK2-Cys436Val mutant 
kinase that was expressed and activated similar to wild- 
type RSK2, but was rendered insensitive to fmk, as pre- 
dicted. Genomic sequencing of the RSK2-Cys436Val clone 
revealed homozygous knockin of the desired alterations at 
the RSK2 locus (Figure IG). 

Challenging knockin applications are facilitated by the use of 
2 A-coupled co-expression of fluorescent proteins and ZFNs 

We next tested the ability of our method to enrich for rare 
targeting events. First, we knockin-targeted the RSK4 lo- 
cus by transfecting K562 cells with the same reagents as 
in Figure ID. We thereafter diluted the transfected cells 
with mock-treated cells to obtain a cell pool in which the 
transfected cells constituted only ~0.5% of the total cell 
population. Unsorted cells from this pool were single-cell 
cloned, whilst another portion of the cell pool was subjected 
to FACS enrichment for cells displaying EGFP/DsRed2 
double-fluorescence followed by single-cell cloning. The 
non-sorted cell pool yielded no modified clones out of 810 
clones analysed (Figure 2 A, upper panel). In contrast, the 
FACS-enriched pool yielded 10 monoallelic (6%) and five 
biallelic (3%) knockin clones out of the 171 clones analysed, 
demonstrating that our method can enrich for rare target- 
ing events in this partial model of an inefficient transfection 
scenario. 

We next tested our method for knockin editing of ac- 
tual difficult-to-transfect cells such as human leukemic Ju- 
rkat T cells. Again, we used the reagents described above 
to edit the RSK4 locus, which was found to be triploid 
in Jurkat cells by gene copy number analysis (Supplemen- 
tary Table S6). In parallel, we transfected K562 cells with 



the same reagents and also mock-transfected both Jurkat 
and K562 cells with irrelevant mRNA not encoding fluo- 
rescent protein. Three days post-transfection, FACS profiles 
were established (Supplementary Figure S5). The profiling 
in mock-transfected cells in Supplementary Figure S5B de- 
fined the cell populations (Q3) devoid of detectable spe- 
cific fluorescence signal from transfected EGFP or DsRed2. 
Based on these data, cefl populations with specific EGFP 
signal (Ql), specific DsRed2 signal (Q4) or specific sig- 
nals for both EGFP and DsRed2 could be assigned in the 
FACS profiles in Supplementary Figure S5A. Comparing 
the various FACS profiles demonstrated a lower transfec- 
tion efficiency of Jurkat cells, as compared to K562 cells: 
thus, a much larger fraction of the Jurkat cell population 
was present in Q3. Furthermore, the expression levels were 
also lower in the Jurkat cell population, as evidenced by 
many fewer medium and highly fluorescent cells, in partic- 
ular EGFP/DsRed2-double fluorescent cells, as compared 
to the K562 cell population. Importantly, however, when 
sorting for the 15% most highly EGFP/DsRed2 double- 
fluorescent Jurkat cells prior to single-cell plating, we ob- 
served an ~8-fold increase in the isolation of clones with 
targeted knockin at the RSK4 locus, as compared to non- 
sorted cells (Figure 2A, lower panel). Moreover, sorting 
even produced one triallelic knockin clone amongst the 14 
clones analysed. 

In the challenging Jurkat cells, we also tested the ad- 
vantage of 2A-coupled co-expression of ZFN and fluores- 
cent protein as compared to non-coupled co-expression, in 
which ZFN and fluorescent protein are expressed from sep- 
arate constructs. Thus, Jurkat cells were either transfected 
with mRNA for EGFP-2A-ZFNL and DsRed2-2A-ZFNR 
or with mRNA for ZFNL and ZFNR along with a sepa- 
rate plasmid expressing EGFP. Again, the ZFNs targeted 
the RSK4 locus and the aforementioned ssODN donor 
Cys443Val/BamHI were co-transfected for knockin in this 
gene. Prior to sorting, the level of modification in the two 
cell pools was found to be similar by CEL-I assay and thus 
the separate EGFP construct did not negatively affect the 
non-coupled ZFNs (data not shown). Furthermore, FACS 
analysis showed that the fraction of EGFP fluorescent 
and EGFP/DsRed2 double-fluorescent cells were similar 
(~35%) in the two cefl pools (Supplementary Figure S6A). 
The two cell pools were then sorted for EGFP fluorescence 
and EGFP/DsRed2 double-fluorescence, respectively, and 
single cells were plated out for clonal expansion. As shown 
in Supplementary Figure S6B, 2A-coupled co-expression of 
ZFN and fluorescent protein increased the knockin edit- 
ing frequency ~6-fold as compared to non-coupled co- 
expression. Thus, 2 A-coupled co-expression produced five 
mono/biallelic and one triallelic knockin clone out of 174 
clones, whereas non-coupled co-expression produced only 
one mono/biallelic knockin clone out of 171 clones. The 
lower knockin frequencies obtained in this experiment com- 
pared to that in Figure 2A are due to the fact that we sorted 
for cells showing any fluorescence level, as compared to 
sorting for the top- 15% expressers in Figure 2A. 

We extended our approach to investigate whether it 
could enable simultaneous knockin modification of two 
genes at all alleles in the same cell. K562 cells were co- 
transfected with mRNA expressing EGFP-linked ZFNs 
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Figure 2. Fluorescent protein-2A-ZFN constructs enable efficient knockin genome editing in demanding applications. (A) K562 or Jurkat cells were 
transfected with mRNA expressing EGFP-2A-ZFNL and DsRed2-2A-ZFNR for RSK4 along with an ssODN knockin donor {rSK4 Cys443Val/BamHI). 
For K562, the cells were diluted 3 days post-transfection with mock-transfected cells to produce a pool of ~0.5% ZFN-expressing cells. Cells of this 
pool were seeded singly or subjected to FACS isolation of EGFP/DsRed2 double-fluorescent cells (any level) that were thereafter also seeded singly. For 
Jurkat, non-sorted cells or cells FACS isolated from the 15% most highly EGFP/DsRed2 double-fluorescent cell population were seeded singly 3 days 
post-transfection. In both experiments, the singly seeded cells were expanded to clonal cell lines and analysed for knockin mutation by RFLP analysis. For 
K562, data are means + range of two independent experiments. (B) K562 cells were co-transfected with mRNA expressing EGFP-2A-ZFNs (left and right) 
for RSK4 and DsRed2-2A-ZFNs (left and right) for PRMTl along with ssODN donors for knockin modification of these loci {RSK4 Cys443Val/BamHI 
and PRMTl Seal). Three days after transfection, non-sorted cells or cells FACS isolated from the 15% most highly EGFP/DsRed2 double-fluorescent 
cell population were seeded singly, expanded to clonal cell lines and analysed for complete knockin modification of the RSK4 and PRMTl loci. Data are 
summed of three independent experiments. (C) Overview of strategy for generation of cell pools with high levels of stable knockin modification. In this 
case, left and right ZFNs of a ZFN pair (ZFNL and ZFNR) have been coupled to distinct fluorescent proteins. (D) Example of close to 100% knockin in 
an MCFIOA cell pool. Specifically, MCFIOA cells were transfected for codon-conversion knockin at the RSK2 locus as described in Figure IE, (ii). Three 
days after transfection, the 40% most highly fluorescent cells were FACS isolated and cultured as a pool for 2 weeks. Thereafter, this cell pool was subjected 
to the same treatment for two more rounds. After the third round, the cell pool was analysed for stable knockin at the RSK2 locus by RFLP analysis with 
untreated MCFIOA cells serving as a control. PCR products derived from wild-type and mutant alleles are indicated by an asterisk and arrows, respectively. 
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Figure 3. pFP-ZFNL-ZFNR single vector genome editing. (A) Schematic 
of a pFP-ZFNL-ZFNR vector that co-expresses both ZFNs of a pair along 
with a fluorescent protein (FP) from the same transcript and as three sepa- 
rate entities. Functional parts indicated are as in Figure 1 A. (B) Jurkat cells 
were transfected with mRNA for a GFP-ZFNL-ZFNR construct target- 
ing the RSK2 locus. Three days after transfection, genomic DNA was iso- 
lated from non- sorted cells or from cells FACS isolated for low, medium 
or high GFP fluorescence intensities and analysed by CEL-I assay. PGR 
products containing amplicons derived from mutant RSK2 alleles or from 
wild-type alleles only are indicated by arrows and an asterisk, respectively. 
The experiment was repeated three times with similar results. N.D.: none 
detected. 

(L and R) for RSK4 as well as DsRed2-linked ZFNs (L 
and R) for PRMTl (triploid in these cells — Supplementary 
Table S6), along with ssODN donors for knockin mod- 
ification of each locus {RSK4 Cys443Val/BamHI and 
PRMTl Seal). Three days after transfection, cells were 
subjected to FACS and cells from the 15% most highly 
EGFP/DsRed2 double-fluorescent population were single- 
cell cloned. Strikingly, FACS enrichment enabled simulta- 
neous homozygous knockin mutation of the two genes at 
all five alleles in two clones out of 132, whereas no homozy- 
gously modified clones were present amongst 185 clones 
from unsorted cells (Figure 2B). 

Some cell lines and primary cells are inherently recalci- 
trant to single-cell cloning and so it may be necessary to 
create pools of cells that are highly enriched for the de- 
sired mutation. To this end, we evaluated the utility of sev- 
eral rounds of ZFN transfection and FACS on a cell pop- 
ulation to generate pools of cells with high knockin mu- 
tation levels (Figure 2C). Human immortalized, but non- 
transformed, mammary MCFIOA cells that are difficult 
to expand from single cells after FACS were transfected 



with EGFP-2A-ZFNL and DsRed2-2A-ZFNR mRNA 
for RSK2 and the aforementioned ssODN donor RSK2 
Cys436Val/SfcI. Three days after transfection, the 40% 
most highly EGFP/DsRed2 double-fluorescent cells were 
isolated by FACS, followed by 2 weeks in culture. There- 
after, this cell population was subjected to the same treat- 
ment two more times. Strikingly, after three rounds of treat- 
ment, a pool of MCFIOA cefls had been obtained with 
nearly complete targeted codon conversion at the RSK2 lo- 
cus [diploid in these cells, (34)], as evidenced by the nearly 
complete digestion in the RFLP assay performed on the cell 
pool (Figure 2D). 

Single vector fluorescent protein-ZFNL-ZFNR methods 

To further expand the utility of our approach, we generated 
variants of the Fluo-2A-ZFN construct, specifically pFP- 
ZFN constructs that can rapidly be converted into pFP- 
ZFNL-ZFNR constructs that encode both ZFN monomers 
of a pair coupled to a single fluorescent protein (Figure 3A). 
First, pFP-ZFN vectors were generated to encode GFP and 
RFP 2A-coupled to ZFNL and ZFNR, respectively (Sup- 
plementary Figure S7). To combine both ZFNs into a sin- 
gle, GFP-coupled construct, the 2A-ZFNR sequence can 
be excised from a pRFP-ZFNR vector using the restriction 
enzymes Bglll and Xhol and inserted into a pGFP-ZFNL 
construct digested with the same enzymes. Intervening 2A 
sequences ensure that each of the three functional compo- 
nents is expressed as a separate entity. We tested a pGFP- 
ZFNL-ZFNR construct for NHEJ-based modification in 
the Jurkat cell line. To this end, Jurkat cells were trans- 
fected with mRNA encoding GFP-ZFNL-ZFNR for RSK2 
(triploid in these cells — Supplementary Table S6). Three 
days later, the cells were left unsorted or were FACS isolated 
for low, medium or high fluorescence intensities roughly 
corresponding to populations 'a\ 'b' and 'd' in Figure ID). 
Subsequent CEL-I assay showed no detectable NHEJ mod- 
ification in the unsorted, low or medium fluorescent cell 
pools. By contrast, in the highly GFP positive cell pool, 
~18% of the alleles were modified (Figure 3B), demonstrat- 
ing that FP-ZFNL-ZFNR constructs work effectively. 

Off-target effects associated with FACS enrichment for cells 
with high ZFN levels 

We assessed whether enrichment for cells with high ZFN 
levels was associated with high off-target cutting. First, we 
analysed a large number of K562 knockin clones estabhshed 
from cell populations with high levels of ZFNs, specifically, 
the populations 'd /'e' for RSK4 and 'c' for RSK2, as de- 
picted in Figure IE. We only analysed clones with bial- 
lelic (complete) knockin, since these may generally repre- 
sent clones derived from cells with particularly high levels 
of ZFN activity and may therefore also be most suscepti- 
ble to off- target cutting. We queried the human genome for 
sequences with the highest similarity to the target sites of 
the RSK4 and RSK2 ZFNs, then analysed top-ranking can- 
didate off-target sites for mutations by Sanger sequencing. 
Strikingly, no mutations were detected when analysing four 
top-ranking off-target sites in ~300 clones, probing a total 
of 2372 alleles (Figure 4A). These off-target sites had four to 
seven mismatches relative to the respective on-target sites. 
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Figure 4. Off-target cutting associated with FACS for high ZFN levels in K562 cells. (A) K562 clones with biallelic knockin at the RSK4 and RSK2 
loci established from the populations '</ /'e' and 'c' in Figure IE, respectively, were analysed for mutations in four top-ranking candidate off-target sites 
(off-targets 1-4) by Sanger sequencing. The ZFN on-target site is indicated in red and mismatches in off-target sites relative to the target site are boldface. 
The spacer sequence between the ZFN monomer binding sites is in italics. (B) K562 cells were transfected with EGFP-2A-ZFNL and DsRed2-2A-ZFNR 
for RSK4 or RSK2 along with ssODN donors for either gene. Three days later, genomic DNA was isolated from the 20% most highly fluorescent cell 
population and analysed by CEL-1 assay for modification at the target site as well as the four off-target sites shown in panel (A). PGR products containing 
amplicons derived from mutant alleles or PGR products derived from wild-type alleles only are indicated by the arrows and asterisks, respectively. The 
experiment was repeated three times with similar results. N.D.: none detected. 



The absence of off-target mutagenesis in clones derived 
from cells with high levels of above ZFNs was confirmed by 
complementary analyses. Thus, CEL-I assays failed to de- 
tect mutations in any of the above sites in pooled genomic 
DNA isolated from highly double-fluorescent cell popula- 
tions 3 days after transfection with EGFP-2A-ZFNL and 
DsRed2-2A-ZFNR for RSK4 or RSK2 along with the 
aforementioned ssODN donor for either gene (Figure 4B). 
Neither did CEL-I assays detect any mutations in an addi- 
tional six top-ranking candidate off-target sites with six and 
seven mismatches relative to the on-target site of the RSK4 
and RSK2 ZFNs, respectively (Supplementary Figure S8). 

We further assessed the issue of off-target cutting by 
analysing a number of CHO cell lines with knockout of 
enzymes in the protein glycosylation pathway that we gen- 
erated using our method. The clones had biallelic, NHEJ- 
derived knockout of either of four genes and were estab- 
lished from the top 2-18% most highly double-fluorescent 
cell populations after transfection with the relevant EGFP- 
2A-ZFNL and DsRed2-2A-ZFNR constructs (data not 
shown). Only one monoallelic off-target mutation was iden- 
tified by Sanger sequencing of various top-ranking off- 
target sites for the four ZFNs in a number of clones, with a 



total of 94 alleles probed (off-target 1 for B4GALT4 ZFNs; 
Supplementary Figure S9A and C). 

CEL-I assay on cell pools 3 days after transfection with 
B4GALT4 ZFNs confirmed that these ZFNs had a propen- 
sity to cut off-target site 1 upon sorting for high expres- 
sion levels, but with a lower efficiency than the on-target 
site (Supplementary Figure S9B). We therefore also anal- 
ysed modification of both these sites in cells FACS en- 
riched for medium ZFN levels to test if it was possible to 
generate a cell population with an even more favourable 
on/off-target mutagenesis ratio. Interestingly, in the popu- 
lation with medium ZFN levels, detectable off-target cut- 
ting was eliminated, whilst the on-target modification re- 
mained high. 



CRISPR/Cas9-based genome editing is facilitated using the 
2A-coupled fluorescent protein/FACS enrichment strategy 

To explore our method in relation to other nucleases used 
for genome editing, we generated a construct expressing 
Cas9-2A-GFP under control of the CMV promoter as well 
as the gRNA sequence for the target gene driven by the 
U6 promoter (pCMV-Cas9-FP; Figure 5A). K562 cells were 
transfected with this construct encoding gRNA targeting 
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Figure 5. CRISPR/Cas9 genome editing is enhanced using the 2A-coupled fluorescent protein/FACS strategy. (A) Schematic of a construct, pCMV- 
Cas9-FP, that co-expresses gRNA and Cas6-2A-fluorescent protein. Functional parts indicated are U6 promoter, guide (g)RNA, CMV promoter, Cas9 
nuclease, 2 A peptide and fluorescent protein (GFP). (B)-(D) K562 cells were transfected (B) with a Cas9-GFP construct targeting the KRAS locus, (C) 
with a Cas9-GFP construct targeting the VEGFA locus or the same construct containing a nickase version of Cas9 (Cas9N) or (D) with such constructs 
targeting the EMXl locus as a nickase pair or as individual nickases coupled to GFP or dsRed2. An ssODN donor (EMXl gRNA 1 /2 EcoRV) for knockin 
of an EcoRV restriction site in EMXl was co-transfected. Three days after transfection, genomic DNA was isolated from non-sorted cells or from cells 
FACS isolated for low, medium or high fluorescence intensities and analysed by RFLP or CEL-I assay, as indicated. In all experiments, repeated three 
times with similar results, PGR products containing amplicons derived from mutant alleles or from wild-type alleles only are indicated by the arrows and 
asterisks, respectively. N.D.: none detected. 



KRAS. After three days, the cells were left unsorted or 
were FACS isolated for low, medium or high GFP expres- 
sion levels and subsequently assayed for NHEJ modifica- 
tion of the KRAS locus (triploid in K562 cells; Supplemen- 
tary Table S6). In unsorted cells, ~15% of the alleles were 
modified, which increased to ~24%, ~41% and ~59% in 
cells displaying low, medium and high fluorescence levels. 



respectively (Figure 5B). Qualitatively similar results were 
obtained with the same construct in which the gRNA tar- 
geted the VEGFA locus (Figure 5C, upper panel), which is 
tetraploid in K562 cells (Supplementary Table S6). Thus, 
2A-linked co-expression of fluorescent protein and Cas9 
combined with FACS can also be used to effectively increase 
genome editing frequencies in the CRISPR/Cas9 system. 
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However with certain gRNAs, the CRISPR/Cas9 system 
has recently been found to exhibit significant off-target cut- 
ting (3 5-3 8). These include the particular VEGFA gRNA 
used here, which was found to target additional genomic 
sites with relatively high efficiency (36). We analysed two of 
these sites and found that their modification also increased 
upon FACS for high Cas9 expression levels, one of them to 
the same extent as the on-target site (Figure 5C, middle and 
lower panels). 

To increase the utility of our method for CRISPR/Cas9- 
based editing, we tested whether it functions in conjunc- 
tion with CRISPR/Cas9 nickase mutant pairs, which ex- 
hibit enhanced specificity, partly due to low indel mutagen- 
esis rates by the individual nickases of the pair as compared 
to CRISPR with wild-type Cas9 (12-14). Accordingly, we 
demonstrated that the on- as well as off-target NHEJ muta- 
genesis by the VEGFA CRISPR/Cas9 construct observed in 
the top-expressing cells by CEL-I assays was non-detectable 
with the DlOA mutant nickase version of Cas9 (Cas9N (9)) 
(Figure 5C, last lane in all panels). Thus, the use of Cas9N 
can reduce the level of off-target modification associated 
with FACS for high-level Cas9 expression. 

To incorporate Cas9N into our method, we generated 
CRISPR/Cas9N pairs, in which one construct contained 
Cas9N that was 2A-coupled to GFP and the other construct 
contained Cas9N 2A-coupled to DsRed2. K562 cells were 
co-transfected with such a pair harbouring gRNA pairs 
previously used to target the EMXl locus (12). To further 
assess our method for knockin editing, we co-transfected 
an ssODN donor designed to introduce an EcoRV site in 
the locus. Three days post-transfection, the cells were left 
unsorted or were FACS isolated for low, medium or high 
GFP/DsRed2 double-fluorescence levels. As we observed 
for ZFN pairs, genome modification levels increased with 
increasing fluorescence levels present in cells transfected 
with CRISPR/Cas9N pairs (Figure 5D). In the most highly 
fluorescent cefl population, ~11% of alleles of the EMXl 
locus (triploid in K562 cells; Supplementary Table S6) were 
knockin modified, as evidenced by RFLP assay. Total mod- 
ification rates were ~23%, as evidenced by CEL-I assay, in- 
dicating NHEJ mutations in addition to the knockin modi- 
fication. By contrast, no mutagenesis was detectable in top- 
expressing cells after transfection with only one member of 
the CRISPR/Cas9N pair (Figure 5D). Qualitatively similar 
results were obtained with another CRISPR/Cas9N pair, in 
which one of the gRNAs was swapped for another EMXl 
target sequence (Supplementary Figure SIO). 



DISCUSSION 

We have developed a method for genome editing via 2A- 
coupled co-expression of nuclease and fluorescent protein 
followed by FACS enrichment of cells with high nucle- 
ase levels. In the several and diverse variations tested, the 
method was found to greatly increase genome editing effi- 
ciencies of ZFNs as well as CRISPR/Cas9. In most cases, 
this occurred in a dose-dependent manner: the higher nucle- 
ase levels sorted for, the higher genome editing frequencies 
achieved, both for NHEJ mutagenesis and for knockin edit- 
ing. 



Our data agree with the finding in the model system 
of Certo et al (21) that increased levels of nuclease re- 
sult in increased levels of mutagenesis. In the present study, 
we mainly delivered donors as ssODNs and nucleases as 
mRNA. Whilst high knockin levels may also be achieved 
with plasmid constructs for donors and nucleases, it is note- 
worthy that the herein used delivery forms work effectively 
in our method, since ssODNs and mRNA have several ad- 
vantages: first, ssODN donors can be obtained fast and in- 
expensively. Second, nuclease expression from mRNA is in- 
dependent of issues relating to the efficiency, or lack thereof, 
of a given plasmid promoter in the targeted cell type. Third, 
when using ssODN and mRNA delivery, editing can be per- 
formed entirely without double-stranded DNA, thus min- 
imizing the risk of random genomic integration of donor 
and nuclease constructs, which by the nature of our method 
are introduced in very high numbers when sorting for the 
most efficiently transfected cells. Finally, we consider it 
Hkely that the use of ssODN donors contributes to the high 
knockin frequencies obtained in several of the present ex- 
periments. Accordingly, we previously found that ssODNs 
are more efficient than plasmids as donor in side-by-side 
comparisons (15). The reason may be that the small size 
of ssODN donors (~100 bp) compared to plasmid donors 
(4-6 kb) permits inclusion of a greater numerical level of 
donor constructs in the transfection reaction, which should 
ultimately increase knockin frequencies. This notion would 
agree with the study by Certo et al (21), showing that high 
levels of donor promotes knockin editing at the expense of 
NHEJ. Recently, fluorescent proteins have also been 2A- 
coupled to TALENs followed by FACS for transfected cells 
(39). Whilst this study did not enrich for specific nuclease 
levels, it demonstrates that the fluorescent protein-2A strat- 
egy also works for TALENs, which may therefore similarly 
be used in the various enrichment protocols described here. 

One advantage of 2A-coupled co-expression of nuclease 
and fluorescent protein is that FACS enrichment for flu- 
orescent cells guarantees that these cells also took up the 
nuclease construct. For nucleases acting as a pair, the cou- 
pling of each monomer to a distinct fluorescent protein fur- 
thermore ensures that double-fluorescent cells took up both 
monomers of the pair. The benefit of the dual labelling was 
particularly evident in the Jurkat cells, where a substantial 
fraction of cells appeared to express only one monomer of 
the ZFN pair. Whilst both monomers need be present in 
the cell, a 1:1 expression ratio may not be optimal for all 
ZFN pairs (40). For such ZFN pairs, different amounts 
of individual monomer constructs may be transfected, fol- 
lowed by sorting for a population in which the individual 
monomers are present both at optimal relative levels and 
at as high levels as possible. The approach of 2A-coupling 
both ZFNs of a pair to the same fluorescent protein, as in 
pFP-ZFNL-ZFNR constructs, is another means to ensure 
that FACS will only enrich for cells that have taken up both 
ZFNs of a pair. This single vector strategy might be par- 
ticularly useful for multiplex editing, in which ZFN pairs 
for several different genes could each be linked to a distinct 
colour for simultaneous multi-colour FACS enrichment of 
cells in which several genes may have been modified. Fi- 
nally, 2A-coupled constructs have the advantage that a 'nor- 
mal' fluorescence signal for a particular cell type can reveal 
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that the co-encoded nuclease construct is of good quality, 
i.e. not degraded, a control that may be particularly use- 
ful with labile RNA nuclease constructs. By contrast, non- 
coupled co-expression of the fluorescence marker via a sep- 
arate construct cannot guarantee that fluorescent cells have 
also taken up the co-transfected nuclease. Such potential 
discrepancy between fluorescence signal and nuclease up- 
take may be part of the reason why we observed higher 
genome editing frequencies with 2A-coupled co-expression 
of the fluorescent protein and ZFN than with non-coupled 
co-expression. 

In the surrogate gene reporter strategy, nucleases are 
co-transfected with a reporter that constitutively expresses 
RFP to mark transfected cells and conditionally expresses 
GFP upon indel mutagenesis of a nuclease target site placed 
upstream of the GFP (20), thereby marking cells express- 
ing active nuclease. The authors showed that FACS can 
thereafter be used to successfully isolate cell populations en- 
riched for active nuclease and NHE J-based mutation of the 
chromosomal target gene. Compared to the surrogate re- 
porter strategy, the present method allows for more donor 
in the transfection reaction, can be performed without the 
use of double-stranded DNA, has more fluorescence mark- 
ers available for multi-plexing (since none are used for a re- 
porter construct) and does not require construction of a sur- 
rogate reporter target site. Furthermore, since fluorescence 
signal in the 2A-coupled approach is a more direct measure 
of nuclease levels, it may be easier to balance nuclease levels 
for an optimal on/off-target ratio using the present method, 
as discussed below. 

Although ZFNs are rather specific reagents, off-target 
cleavage can occur and may increase at elevated levels of 
nuclease (33,41). However, for the most exhaustively char- 
acterized ZFNs in this study {RSK4 and RSK2), we found 
no mutations in top candidate off- target sites in cells en- 
riched for very high ZFN levels, using a depth of analysis 
that should detect off-target rates occurring with >1% on 
average. Furthermore, analysis of a number of CHO knock- 
out clones generated using our method also suggested very 
modest off-target effects. It should be noted, however, that 
the extent of off- target cutting is inherently dependent on 
ZFN design. The ZFNs used here had no predictable off- 
target sites with less than four nucleotide mismatches com- 
pared to the on-target site. Furthermore, the target sites 
were 30-33 bps long in total. It is possible that higher off- 
target levels will be observed in top-expressing cells if ZFNs 
with off-target sites containing fewer mismatches and/ or 
having shorter target sites are used. Both of these features 
promote off-target cutting and were features of several of 
the ZFNs used in the aforementioned studies on the off- 
target activity of ZFNs. It should also be noted that we 
only analysed predictable off-target sites (i.e. homologous 
off-target sites), and thus did not address potential off- 
target actions at non-homologous sites, which would re- 
quire whole-genome sequencing or other unbiased genome- 
wide strategies (33). Nevertheless, at this point our data sug- 
gest that with ZFN designs similar to those used here, off- 
target effects associated with our method are very modest 
and that it seems easy to derive clones with no modifica- 
tions in predictable off-target sites. 



Interestingly, in the one case in which off-target mutagen- 
esis occurred at high ZFN levels, it was possible to choose a 
cell population with medium ZFN levels, which eliminated 
detectable off-target cutting, whilst maintaining high rates 
of on-target modification. We speculate that mismatches in 
off-target sites render their cutting more sensitive to de- 
creasing ZFN concentrations than the on-target-site, a no- 
tion supported by ZFN off-target analyses in vitro (41), such 
that nuclease levels can be obtained where on-target cut- 
ting is relatively much higher than the off-target event, as 
compared to saturating doses of nuclease. With the current 
knowledge on ZFN action it is not possible to predict a pri- 
ori an optimal ZFN expression level showing the desired on- 
and off-target cutting ratio. This must be determined on a 
case-by-case basis, for example, by performing experiments 
like the one shown in Supplementary Figure S9B. Such ex- 
periments may be able to identify a cell population with a 
suitable nuclease level, which may subsequently be used to 
generate modified cell clones or cefl pools. 

Regarding the CRISPR/Cas9 system, the VEGFA con- 
struct used here produced increased cutting at certain off- 
target sites upon FACS for high nuclease levels. These find- 
ings agree with previous studies that some gRNAs, in- 
cluding the present VEGFA gRNA, exhibit significant off- 
target actions that may increase with increasing amounts 
of gRNA (35-38). Although we did not test the possibil- 
ity, in some cases our method might be used to balance 
CRISPR/Cas9 on- and off-target effects in the same man- 
ner we showed for ZFNs. However, this will not be possi- 
ble for certain gRNAs, as exemplified in the present case 
with VEGFA, in which the on-target site and off-target site 
1 were mutated to the same extent at the various expres- 
sion levels. It is possible that more careful design and/or 
new developments in the design of gRNA, such as de- 
scribed in (13), may ameliorate the off-target issue of our 
method applied with CRISPR/Cas. However, it is notewor- 
thy that our method also increased the genome editing ef- 
ficiency of CRISPR/Cas9N nickase pairs, a version of the 
CRISPR/Cas9 system that shows greatly improved speci- 
ficity (12-14). The individual CRISPR/Cas9Ns of a pair 
will likely show increased off- target nicking upon FACS 
enrichment for high Cas9N levels. However, since DNA 
single-strand breaks are predominantly repaired by the 
high-fidelity base excision repair pathway (42), off- target 
nicks will rarely be mutagenic. Accordingly, we detected 
no indel mutagenic effects of individual CRISPR/Cas9Ns 
in cells sorted for high-level expression of such constructs. 
Another reason for the higher specificity of CRISPR/Cas9 
nickase pairs is a combined target sequence that is twice as 
long as that of an individual CRISPR/Cas9 construct. Al- 
together, nickase pairs seem a particularly safe version of 
the CRISPR/Cas9 genome editing system for use in our 
method. 

We primarily validated our method by assessing genome 
editing in stably modified clonal cell lines or stable cell 
pools. This is opposed to assessing editing in genomic 
DNA from cell pools 3 days after transfection with edit- 
ing reagents, as is often performed, but which may not 
be a reliable indicator of stable modification rates. For in- 
stance, in our hands and in the hands of others, modifica- 
tion rates observed 3 days post transfection can be many- 
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fold higher than those eventually observed in stably mod- 
ified cells (15,20). Altogether, we vaHdated our method in 
a very large set of clonal cell lines (>4000), of which 426 
were monoallelic and 648 were biallelic knockin clones. This 
also highlights that in some cases our method can serve 
as a high-throughput strategy for generation of targeted 
knockin clones. The ability to generate many clones with 
the same mutation may be a powerful means to minimize 
the confounding impact of clonal variation on downstream 
functional analyses, a problem inherent to most studies on 
clonal cell lines. For example, 25 knockin clones may be gen- 
erated, pooled and subjected to phenotypic analysis, a strat- 
egy that may effectively dilute out non-nuclease-derived 
phenotypic differences of individual clones in the overall re- 
sults achieved when studying the clone pool. A similar ben- 
efit may be obtained by studying cell pools with close-to- 
complete genome modification via consecutive rounds of 
ZFN transfections and FACS on a cell population, as we 
also found feasible in the present study. 

SUPPLEMENTARY DATA 

Supplementary data are available at NAR Online. 
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